Deterministic Strategies for Query (re)formulation in Information Retrieval
نویسنده
چکیده
Relevance feedback is a process whereby a user examines the documents selected by a retrieval system and provides feedback t o the system as t o their relevance. Such a feedback can then be used t o formulate an optimal query with respect t o the current infwmation need of the user. This process of query (re)formulation can be based on probabilistic concepts, where Bayesian decision theory provides the framework for a decision rule, or ideas which instead employ deterministic strategies. The former class of techniques are limited by the fac t tha t (strong) assumptions have t o be made concerning the nature of the conditional orobabilitv densitv functions characterizinn the data. In contrast, deterministic techniques, which do not require any explicit assumptions about the distribution of the various descriptor vaiues, can b e adopted. Such methods would have t h e advantages of being %on-parametric" and robust (useful in a wide variety of contexts). A d a s s of deterministic techniques has been advocated by Salton and the SMART project group at Cornell. However, tha t approach d w s not obtain a new q w r y t h a t can be claimed t o be optlmal in a certain sense. In this work, a deterministic method t h a t obtains an optimal query according t o a prescribed criterion is advanced. Furthermore, i t will be demonstrated t h a t such methods a r e applicable not only when there a r e two classes of relevance (relevant and non-relevant) but also when t h e feedback distinguishes documents according t o several degrees of relevance.
منابع مشابه
QEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملBoosting Passage Retrieval through Reuse in Question Answering
Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005